encoder block
- Europe > United Kingdom (0.14)
- North America (0.04)
- North America > United States > Massachusetts > Middlesex County > Cambridge (0.06)
- North America > United States > Oregon > Multnomah County > Portland (0.05)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)
PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting
Zhao, Bowen, Xing, Huanlai, Xiao, Zhiwen, Peng, Jincheng, Feng, Li, Wang, Xinhan, Qu, Rong, Li, Hui
PeriodNet: Boosting the Potential of Attention Mechanism for Time Series Forecasting Bowen Zhao, Huanlai Xing, Zhiwen Xiao, Jincheng Peng, Li Feng, Xinhan Wang, Rong Qu, Hui Li The proposed PeriodNet hybridizes a period attention mechanism, an iterative grouping mechanism, and a period diffuser architecture to achieve accurate multivariate time series forecasting. The period attention mechanism captures temporal similarities among adjacent periods to improve time series modeling. The period diffuser architecture leverages multi-scale period features extracted by the encoder to enhance the accuracy and efficiency of time series forecasting. Abstract The attention mechanism has demonstrated remarkable potential in sequence modeling, exemplified by its successful application in natural language processing with models such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformer (GPT). Despite these advancements, its utilization in time series forecasting (TSF) has yet to meet expectations. Exploring a better network structure for attention in TSF holds immense significance across various domains. In this paper, we present PeriodNet with a brand new structure to forecast univari-ate and multivariate time series. PeriodNet incorporates period attention and sparse period attention mechanism for analyzing adjacent periods. It enhances the mining of local characteristics, periodic patterns, and global dependencies. For efficient cross-variable modeling, we introduce an iterative grouping mechanism which can directly reduce the cross-variable redundancy. To fully leverage the extracted features on the encoder side, we redesign the entire architecture of the vanilla Transformer and propose a period diffuser for precise multi-period prediction. Through comprehensive experiments conducted on eight datasets, we demonstrate that PeriodNet outperforms six state-of-the-art models in both univariate and multivariate TSF scenarios in terms of mean square error and mean absolute error.
- Europe > United Kingdom > England > Nottinghamshire > Nottingham (0.14)
- Europe > Austria > Vienna (0.14)
- North America > United States > New York > New York County > New York City (0.04)
- (5 more...)
Data-efficient U-Net for Segmentation of Carbide Microstructures in SEM Images of Steel Alloys
Gerçek, Alinda Ezgi, Korten, Till, Chekhonin, Paul, Hassan, Maleeha, Steinbach, Peter
Understanding reactor-pressure-vessel steel microstructure is crucial for predicting mechanical properties, as carbide precipitates both strengthen the alloy and can initiate cracks. In scanning electron microscopy images, gray-value overlap between carbides and matrix makes simple thresholding ineffective. We present a data-efficient segmentation pipeline using a lightweight U-Net (30.7~M parameters) trained on just \textbf{10 annotated scanning electron microscopy images}. Despite limited data, our model achieves a \textbf{Dice-Sørensen coefficient of 0.98}, significantly outperforming the state-of-the-art in the field of metallurgy (classical image analysis: 0.85), while reducing annotation effort by one order of magnitude compared to the state-of-the-art data efficient segmentation model. This approach enables rapid, automated carbide quantification for alloy design and generalizes to other steel types, demonstrating the potential of data-efficient deep learning in reactor-pressure-vessel steel analysis.
- Europe > Germany > Saxony > Dresden (0.05)
- South America > Peru > Loreto Department (0.04)
- North America > United States (0.04)
- Europe > Albania > Tirana County (0.04)
- Energy (0.55)
- Materials > Metals & Mining > Steel (0.41)
Google-MedGemma Based Abnormality Detection in Musculoskeletal radiographs
Maity, Soumyajit, Kamboj, Pranjal, Maity, Sneha, Singh, Rajat, Chatterjee, Sankhadeep
This paper proposes a MedGemma-based framework for automatic abnormality detection in musculoskeletal radiographs. Departing from conventional autoencoder and neural network pipelines, the proposed method leverages the MedGemma foundation model, incorporating a SigLIP-derived vision encoder pretrained on diverse medical imaging modalities. Preprocessed X-ray images are encoded into high-dimensional embeddings using the MedGemma vision backbone, which are subsequently passed through a lightweight multilayer perceptron for binary classification. Experimental assessment reveals that the MedGemma-driven classifier exhibits strong performance, exceeding conventional convolutional and autoencoder-based metrics. Additionally, the model leverages MedGemma's transfer learning capabilities, enhancing generalization and optimizing feature engineering. The integration of a modern medical foundation model not only enhances representation learning but also facilitates modular training strategies such as selective encoder block unfreezing for efficient domain adaptation. The findings suggest that MedGemma-powered classification systems can advance clinical radiograph triage by providing scalable and accurate abnormality detection, with potential for broader applications in automated medical image analysis. Keywords: Google MedGemma, MURA, Medical Image, Classification.
- North America > United States > Texas > Tarrant County > Arlington (0.05)
- Asia > India > West Bengal > Kolkata (0.04)
A Transformer-based Neural Architecture Search Method
Wang, Shang, Tang, Huanrong, Ouyang, Jianquan
This paper presents a neural architecture search method based on Transformer architecture, searching cross multihead attention computation ways for different number of encoder and decoder combinations. In order to search for neural network structures with better translation results, we considered perplexity as an auxiliary evaluation metric for the algorithm in addition to BLEU scores and iteratively improved each individual neural network within the population by a multi-objective genetic algorithm. Experimental results show that the neural network structures searched by the algorithm outperform all the baseline models, and that the introduction of the auxiliary evaluation metric can find better models than considering only the BLEU score as an evaluation metric.
- Europe > Portugal > Lisbon > Lisbon (0.06)
- Asia > China > Hunan Province (0.05)
- North America > United States > New York > New York County > New York City (0.04)
FASL-Seg: Anatomy and Tool Segmentation of Surgical Scenes
Abdel-Ghani, Muraam, Ali, Mahmoud, Ali, Mohamed, Ahmed, Fatmaelzahraa, Arsalan, Muhammad, Al-Ali, Abdulaziz, Balakrishnan, Shidin
The growing popularity of robotic minimally invasive surgeries has made deep learning-based surgical training a key area of research. A thorough understanding of the surgical scene components is crucial, which semantic segmentation models can help achieve. However, most existing work focuses on surgical tools and overlooks anatomical objects. Additionally, current state-of-the-art (SOT A) models struggle to balance capturing high-level contextual features and low-level edge features. We propose a Feature-Adaptive Spatial Localization model (FASL-Seg), designed to capture features at multiple levels of detail through two distinct processing streams, namely a Low-Level Feature Projection (LLFP) and a High-Level Feature Projection (HLFP) stream, for varying feature resolutions - enabling precise segmentation of anatomy and surgical instruments. We evaluated FASL-Seg on surgical segmentation benchmark datasets EndoVis18 and EndoVis17 on three use cases. The FASL-Seg model achieves a mean Intersection over Union (mIoU) of 72.71% on parts and anatomy segmentation in EndoVis18, improving on SOT A by 5%. It further achieves a mIoU of 85.61% and 72.78% in EndoVis18 and EndoVis17 tool type segmentation, respectively, outperforming SOT A overall performance, with comparable per-class SOT A results in both datasets and consistent performance in various classes for anatomy and instruments, demonstrating the effectiveness of distinct processing streams for varying feature resolutions.
- Health & Medicine > Surgery (1.00)
- Health & Medicine > Health Care Technology (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (0.69)